A Hierarchical Bitmap Indexing Method for Similarity Search in High-Dimensional Multimedia Databases

نویسندگان

  • Jongho Nang
  • Joohyoun Park
  • Jihoon Yang
  • Saejoon Kim
چکیده

This paper proposes an efficient indexing mechanism for similarity search in highdimensional multimedia database that quickly filter-outs the irrelevant objects using a novel indexing structure, called HBI (Hierarchical Bitmap Index). In this bitmap index, the feature (or attribute) value of object at each dimension is represented with a set of two bits each of which indicates whether it is relatively high (‘11’), low (‘00’), or neither (‘01’) compared to the feature values of other objects at a hierarchical organized interval. This approximation helps to reduce the CPU time of filtering process because many irrelevant objects could be simply excluded by just XORing the bitmaps of two objects. Upon experimental results, we find that there is an optimal number of bitmaps that keeps the filtering rate as high as possible while keeping the search time as short as possible. Furthermore, we also find that the similarity search using the proposed indexing mechanism is about 2-3 times faster than VA-File while guaranteeing the exact solutions.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

یک روش مبتنی بر خوشه‌بندی سلسله‌مراتبی تقسیم‌کننده جهت شاخص‌گذاری اطلاعات تصویری

It is conventional to use multi-dimensional indexing structures to accelerate search operations in content-based image retrieval systems. Many efforts have been done in order to develop multi-dimensional indexing structures so far. In most practical applications of image retrieval, high-dimensional feature vectors are required, but current multi-dimensional indexing structures lose their effici...

متن کامل

Approximate Queries on Set-valued Attributes

Sets and sequences are commonly used to model complex entities. Attributes containing sets or sequences of elements appear in various application domains, e.g., in telecommunication and retail databases, web server log tools, bioinformatics, etc. However, the support for such attributes is usually limited to definition and storage in relational tables. Contemporary database systems don’t suppor...

متن کامل

Hierarchical Bitmap Index: An Efficient and Scalable Indexing Technique for Set-Valued Attributes

Set-valued attributes are convenient to model complex objects occurring in the real world. Currently available database systems support the storage of set-valued attributes in relational tables but contain no primitives to query them efficiently. Queries involving set-valued attributes either perform full scans of the source data or make multiple passes over single-value indexes to reduce the n...

متن کامل

Nonlinear Approximate Indexing for Multimedia Data

This paper presents a new nonlinear approximate indexing method for highdimensional data such as multimedia data. The new indexing method is designed for approximate similarity searches and all the work is performed in the transformed Gaussian space. In this indexing method, we first map the input space into a feature space via the Gaussian mapping, and then compute the top eigenvectors in the ...

متن کامل

An efficient nearest neighbor search in high-dimensional data spaces

Similarity search in multimedia databases requires an efficient support of nearest neighbor search on a large set of high-dimensional points. A technique applied for similarity search in multimedia databases is to transform important properties of the multimedia objects into points of a high-dimensional feature space. The feature space is usually indexed using a multidimensional index structure...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • J. Inf. Sci. Eng.

دوره 26  شماره 

صفحات  -

تاریخ انتشار 2010